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Digital Video System and 
Methods For Providing Same 



Background 0 f the lnv>nh> n 



1. Field of tbe Invention 



Tb. present invention relates generallv t0 ^ ^ ^ ^ 
processmg of video images and, more panicularlv> to teclmjques fc ^ ^ 

video images. 

2. Description of the Related Art 

There has been a rapid evol u ,io„ from analog video technology ,„ digita, video 
■echnology because of ft. advantages tha, digital video has ,o offer. Digiu, video can be stored 
and dtsmbuted more cheaply man ar*log video because digiu, video can be s Wre d o„ random* 
access*!, media such as magnetic disc drives (hard disks, and optica! disc media known as 
compact dtscs (CDs). C™. stor* on a randomly accessible media, digiu, vide, may become 
mteracve, aHowing i„o be used in games, catalogs, training, education, and other actions. 

On. of th. newest produce to be based on digita. video technology is the digital video 
te, sometimes called "digiu, versatiie disc" or simp,y "DVD." T*ese discs are the size of an 
audio CD, ye, ho,d up to 1 7 biUion bytes of dau, 26 times me data on an audio CD DVD 
storage capacity (,7 Obyte, is much higher man CD-ROM ,«>0 Mbyte) and a DVD can deliver 
the data at a higher rate man CD-ROM. Wore, DVD technology presents a tr^ous 
movement in video and audio cuality over traditional systems such as televistons, VCRs and 

However, a major problem m utilizing DVD and other digit., video technology ,„ display 
motton ptctures is ,ha, is tha, the sources of motion pictures com. a, different fnme speeds For 
example, standard nlm is shot a. a rate of 24 Hz whil. a Revision broadcast using the National 
Telev,s,on System Committee (NTSQ standa* updates motion at 60 Hz. Convening ,h. 
motton picb.ro into digita, video tough a process called deinterlacin, often produces a 
nottcabl. reduction in resolution as well as dis.ort.ons known as motion artifacts 
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P . blem with ^ di8ilal video tehnolosy js ^ 

™us formats. For example. movies m ^ ^ „ ^ fc t ^ - 
are o«e„ much narrower. video jmage ^ convem a or 
from one orma, ,o another. For examp,,, , ^ image ^ , * 

: y 480 pixei * may have - * ~ - — - * -J* 

splayed on a pan.cu.ar dispiay device ^ „ . LCD ^ ^ , ^ ^ 
^ However, -o« poor an video scaling methods provide ^ rMO|u , io „ „ ^ 

b«w J" a °°' h 7 roWm Wi ' h iMge "*« " ' hM » -* «• of da* 

ptZT rt n0US """" differa " ^ ~ "*« - «-*• * - ^ve 

b*- , . 54 Mhz domain «, , 48 Mhz domam , 9:8 ratio) . ta 

data ^een nvo asynchronous and/or d i( Teren, data rare domains use, ^ expensive buffi 
Aether reason tha, digj.al video technology has been difB c„„ (0 uWize is because DyD 

players must he operationally anached ,o a video disp.ay, sueh as a Revision or televisZ 
«-«. *ey are vinuaiiy impost ,o use anywhere space is limned. Funhermore, 21 
-** vdeo sysrems suffer from a large number of ^ problems. Hand-held and mob* 
*«- - « «hioi, region maladies such as RF multipart mterference atd poor * 

* proven. UP. p,ayb* k capability, bu, ,hey ,end ,o he physical large a* no, Wy 

In vie* of the foregoing, i, is desirable ,o have a ponabte video player capable of beta, 
have meihod and apparatus of dein,er,acing and reforming ma, provides for preserva„o„ of 

processing while minimizing cost. It is desirable to have , m*,iw a 

c . ^auuoie io nave a method and apparatus that DroviHec 

:::=r a. while m,n im , z ,„ 8 cos, and ehmmaUng L need for ~ 
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It should be appreciated that the present invention can be implemented in numerous 
ways, including as a process, an apparatus, a system, a device or a method. Several inventive 
embodiments of the present invention are described below. 

In one embodiment of the present invention, a digital image enhancer is disclosed. The 
digital image enhancer includes a deinterlacing processor receptive to an interlaced video stream. 
The deinterlacing processor includes a first deinterlacer and a second deinterlacer and provides a 
deinterlaced video stream. The digital image enhancer also includes a video output processor 
receptive to the output of the deinterlaced video stream to provide a scaled, deinterlaced video 
stream. 

In another embodiment of the present invention, a digital image enhancer is disclosed 
including a deinterlacing processor receptive to an interlaced video stream and operative to 
provide a deinterlaced video stream. The digital image enhancer also includes a video output 
processor receptive to the output of the deinterlacing processor. The deinterlacing processor 
processes the interlaced video stream in vertical slices to provide a scaled, deinterlaced video 
stream. 

In yet another embodiment of the present invention, a portable DVD player is disclosed. 
The portable DVD player includes a generally thin prismatic enclosure having a first major 
surface, a second major surface separated from said first major surface, and side surfaces 
connecting the first major surface to the second major surface. At least a portion of the first 
major surface includes a video display, and the enclosure includes a DVD entry port such that a 
DVD can be inserted into the enclosure. 

The portable DVD player also includes a digital processing system including a decoder, 
an image enhancement engine, and a display controller. The decoder receives signals from a 
DVD inserted into the enclosure to provide a decoded, interlaced video signal. The image 
enhancement engine converts the interlaced video signal to a deinterlaced video signal. The 
display controller uses the deinterlaced video signal to provide progressively scanned video on 
said video display. 
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In yet another embodiment of the present invention, a method for processing digital video 
is disclosed. The method includes deinterlacing an interlaced video stream by at least one of a 
number of deinterlacing methods to produce a deinterlaced video stream. The method also 
includes scaling the deinterlaced video stream. 

Other aspects and advantages of the invention will become apparent from the following 
detailed description, taken in conjunction with the accompanying drawings, illustrating by way 
of example the principles of the invention. 
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The present invention will be readily understood by the following detailed description in 
conjunction with the accompanying drawings. To facilitate this description, like reference 
numerals designate like structural elements. 

Figures 1 A and IB illustrate a portable DVD player in accordance with one embodiment 
of the present invention. 

Figures 2A, 2B, and 2C illustrate several different applications for the DVD player in 
accordance with one embodiment of the present invention. 

Figure 2D illustrates a docking station and associated video monitor for the DVD player 
in accordance with one embodiment of the present invention. 

Figure 2E illustrates a "DeskTablet™" DVD player in accordance with one embodiment 
of the present invention. 

Figure 3 is a block diagram of the electronic circuitry of the DVD player in accordance 
with one embodiment of the present invention. 

Figure 4 illustrates a system diagram of the Image Enhancement Engine (IE 2 ) in 
accordance with one embodiment of the present invention. 

Figure 5 illustrates a method of combining fields into frames in a deinterlacing process in 
accordance with one embodiment of the present invention. 

Figure 6 is a block diagram of a video deinterlacer in accordance with one embodiment of 
the present invention. 

Figure 7 is a system diagram of a progressive frame detector in accordance with one 
embodiment of the present invention. 

Figure 8 is a flow diagram of the processing steps within a field-differencing module in 
accordance with one embodiment of the present invention. 

Figure 9 illustrates details of the frequency detection module in accordance with one 
embodiment of the present invention. 
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Figure 10 is a system diagram of the PFPD module in ■ i_ 

* 01 me rrru moau,e m accordance with one embodiment 

of the present invention. 

Figure 1 1 is an illustration of a deinterlace process in accordance with one embodiment 

of the present invention. 

Figure 12 shows a two-dimensional array of pixel values used to describe the present 

invention. 

Figure 13 is a diagram illustrating a method for using obtaining an output pixel from the 
two-dimensional array of Figure 12 in accordance with one embodiment of the present invention. 

Figure 14A is an illustration used to describe the method in accordance with one 
embodiment of the present invention. 

Figure 14B is a graph of a set of samples from the sampling line of Figure 14A. 



Figure 14C is a graph of a sampled cosine 



wave. 



Figure 15 is an illustration used to describe the method of thresholding a detection value 
in accordance with one embodiment of the present invention. 

Figure 16 is a block diagram of a mixing circuit in accordance with one embodiment of 
the present invention. 

Figure 17 is a diagram of an exemplary operation of the mixing circuit in accordance with 
one embodiment of the present invention when the DV is greater than "0," but less than "1." 

Figure 18 is an illustration of a method for detecting diagonal features in accordance with 
one embodiment of the present invention. 

Figure 19 is a block diagram of a diagonal mixing circuit in accordance with one 
embodiment of the present invention. 

Figure 20 is a diagram showing the pixels of secondary array used for calculating the 
output of the diagonal mixing circuit of Figure 16. 

Figure 21 is a flow chart of a diagonal detection method in accordance with one 
embodiment of the present invention. 
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Figure 22 is an example of a variable scaling FIR filter in accordance with one 
embodiment of the present invention. 

Figure 23 is a graph of low-pass filter coefficients in the time domain. 



Figure 24 is a table of coefficients organized into L sets of mults 



per set. 



Figure 25 is a flow chart of a method for quantization in accordance with one 
embodiment of the present invention. 

Figure 26 is a flow chart of a method for changing coefficients in accordance with one 
embodiment of the present invention. 

Figure 27 illustrates a video frame in accordance with one embodiment of the present 
invention which is subdivided into a number of vertical slices for a slice scanning sequence 
exemplified by a corresponding number of scan lines. 

Figure 28 illustrates an example of an initial slice core that has a problem with 
unavailable data on its left edge and right edge. 

Figure 29 illustrates a slice that has added wings along the initial slice core's left and 
right edges. 

Figure 30 illustrates an overall structure of overlapping slice/wing combinations. 

Figure 31 is a flow chart illustrating a method of processing video in accordance with one 
embodiment of the present invention. 

Figure 32 illustrates a system diagram for a slice based video processor in accordance 
with one embodiment of the present invention. 

Figure 33 illustrates a system diagram of a video processing chip architecture in 
accordance with one embodiment of the present invention. 

Figure 34 is a diagram of an asynchronous interface in accordance with one embodiment 
of the present invention. 

Figure 35 is a diagram of an alternative asynchronous interface in accordance with one 
embodiment of the present invention. 
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Figure 36 is a diagram of a 3-buffer synchronizer sequence illustrating the sequencing 
and relative relationships of read and write operations to three RAM buffers in accordance with 
one embodiment of the present invention. 

Figure 37 is a flow chart of a method for sequencing through RAM addresses and 
modules in accordance with one embodiment of the present invention. 



Figure 38 is a diagram of a synchronizer of the present invention configured for 
video scaling application in accordance with one embodiment of the present invention. 



use in a 
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Detailed rWription of th, Pr.f, f - f .. ^ w { 

ftoroul" ^ f °"T 8 deSCriP "' 0 "' ~ S " " ^ " «*' - P<°v,de . 

it u rrr ns ° m ° presMt tavemion - " wni * — • » - ««- 

l^l ^ ' nVen,te " ^ h PraC,iCKl Wi,h °" ~ " *" ° f *- *«* 

d«ai,s. In other we,, known ^ haye no( ^ ^ deta . 

order no! to unnecessarily obscun , „,,, pnJsent j nventjon 

Figures ,A and | B i,lustr«e a portable DVD p,aver ,0 in accordance with one 
embodttnen, „f fte present invention. We DVD player , 0 includes a housing ,2 fta, serves as 
an enclosure or chassis for fte component of fte DVD player 10. A screen 14 for viewing ft. 
video and con*,, buttons 16 to con.ro, fte DVD player ,0 are ,oca,ed on fte ,op face of fte 
ousmg ,, Power and signal interfaces ,3 are loca*, on one of fte two side faces o 1 

««, A DVD 24 can fit within a suitably configured recess in fte media t^rt 22. Z 
.he media transport 22 wiftdraws i„,o ft. housing 12 to permit ft, p^k ofDVD 24. 

mos, " ^ " ° f " M " b "' * ito - » «— ,o 

"0. types of exportation. The pr.cn. inven.io„ is ideally sui,ed ,o .he „ ho sp e„d s 

ong hours ,„ a passive such as on an airplane, nain. „ r subway as illustrated in Figure 

2a In addition ma^y commercial applications are possible as well, for example, video 

». also be placed on fte magazine rack of eve* Stairmaster® exercise machine and treadmil, i„ 
a fitness center setting, as illustrated in Fig. 2C. 

Wherever portable video playback is needed ,h. . 
, . p'«yoaci is needed, fte present invent on can be ftere 

*eat sound, DVD and CD playback, or TV viewing is desired, fte presen, invenn" 
- * ft. , ea, mobile so,u,ion. Crea. examples of fte versanlity of fte presen, invention uses 

oTsto" T^ ^ ^ ' - «* <«* *= f capable 

of o„„g thousands of books,, and on-fte-spo, do-it-yourself videos in topics such as auto 
repair, gardening, cooking, home projects. 

Furthermore, the pre sen« invention can be used as a portab,e video display device for 
v^eo games, camcorder piayback, or digita, stil, camera image viewing, J for £ 
applications such as video kiosks, commerca, airliners and fimess centers. The p„ n , b ,e D VD 
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player 10 can also be used to present sales content: bus, elevator, taxicab, real estate walk 
through videos, etc. It can even be utilized as a hang-it-on-the-wall flat television or display 
device - fine art discs, "dynamic art", etc. 

Figure 2D illustrates a "Home Theater Docking Station" provides an uncomplicated, one- 
step connection and operation capability for when the present invention is used as a simple DVD 
player with a normal home television set The dock provides a cabled electrical interface to a 
television or other home theater components - such as an audio system - that can remain 
permanently in place for when the present invention is used with an external system. The dock is 
preferably the same form-factor as a typical VCR; it will be designed to blend in with the rest of 
the system components that a user owns, and also be instantly recognizable for its intended 
function. 

An infrared remote control is preferably bundled with the docking station. The remote 
features a comprehensive control feature set optimized to access all of the additional enhanced 
features available with the DVD format. Central to the design of this remote control is a highly 
attractive industrial design that will carefully avoid the dreaded and difficult-to-use "sea of 
buttons" so commonly found on consumer product remotes. 

Figure 2E illustrates one embodiment in accordance with one embodiment of the present 
invention for the desktop market is referred to herein as a "DeskTablet™" DVD player. 
Applications for the DeskTablet include uses such as in the bedroom, den, or kitchen, where a 
"fixed" unit placement is needed. This product is essentially in the same marketing space as 
conventional TV/VCR combination units. Similar in form factor to the "Monorail" personal 
computer, the thin DeskTablet form factor will be capable of either freestanding or wall hanging 
operation. Freed of many of the portability-driven design constraints required of the present 
invention mobile machine, the DeskTablet will include a high-quality integrated loudspeaker 
system. 

Figure 3 is a block diagram of the drive module 26 of the DVD player 10 of Figures 1 
and 2. The drive module 26 includes the media transport 22 that reads the DVD. Video data 
from the DVD is then transferred over to a MPEG/Dolby digital (or "MPEG/AC-3") decoder 28. 
After decoding, an Image Enhancement Engine™ (IE J ) 30 deinterlaces the digital video to 
provide a progressively scanned video signal. Finally, the video is displayed through a display 



36. 
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a ban. ^ m0dU ' e M '"'"^ " aUdi0/taftared ** » « -.roller 34. 

a ^ * ' **■ *" bu,rer * ad • - * — — 

illustrated components, busses and suh-cvct^e tk. 

f usses, and sub-systems. The components of the DVD drive module 26 

are pnmanly housed within the housing 12 and will each be described in detail below. 

The DVD tra^port 22 prefembly uses an off-the-shelf drive module 26 designed for 
portable cotnptnc, Preferably, the drive module 26 is seventeen mm or less in thickness 

^ °* «"» ■» fM °'- A —* ^ drive module 26 is .viable ta 
Toshtba Arnica ^formation Systems, Inc. of Tokyo, Japan. Fimher , me ^ ^ 26 

me present invent is preferably physical* and elecfronically isolafcd from mechanical shocks 
and .mpacts. More particularly, the drive module 26 is mechanically shock mounted i„ me 
housing ,2. artd da* from the drive module 26 is buffered in memoty for a period of time before 
decodtng ,„ allow sector ,0 be reread in the even, of a shock-induced data stream discontinuity. 

The MPEG/Dolby digita, decoder 28 is preferably an off-the-shelf single^ decker 
argeted a, consumer DVD players. ., prefercbly perf onm both MPEG-1 and MPEG-2 
^.mg Dolby digita, rAC ^, MPEG, and LPCM audio decoding, has A/V synchrotron, 
- 1 has me characteristic that only a single ™ 7 device is required for data storage and 
buffenng. Such decode, are avai,ab,e from a variety of sources, including C Cube 
Microsystems of Milpitas, California. 

The ie 30 prefer deinterlaces digital video from DVD or any other source to provide 
progressrvely panned video s if mal, including propriety motion detection »d compensation 

otr f T" prefenb,y -~ — ■ - — ~ 

conecnon for *. and accurate video display. To frrrther imp™ th, image, a high quality 
«*, engme , the * 30 is preferably used * convert video image from rec^gularToU 
'° * *» "" • **W on flat pane, sizes other mar, 640 x 480 

The audio/IR link 32 (comprising on. or more intend circuit chips, decodes ft. Dolby 
igKa, daa aream provided from MPBG2/Do,by digital decoder 28, mixes 5.1 chamtels down to 
channels for conventional stereo output, arrf erodes/processes 5. 1 channels for sunound 
headphone output (optional module,. Stereo D/A conveners are provided for headphone output. 

D/A converters for 5. 1 channe! Dolby digtul stream are available on the docking mojule 
connector. An undecoded Dolby digita. stream for outboard decoders is a,so avai,ab,e on the 
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docking module connector. An optional IR transmitter for wireless headphones may be 
prodded, as may stereo speakers with small stereo power amp for presentations or playback 

without headphones. 

The system controller 34 is preferably a single-chip microprocessor handling most, if not 
all, system control functions. For example, the microprocessor preferably handles system startup 
and configuration, user interface & controls, feature set selection (e.g., parental control, etc ) 
DVD drive control, and TE\ MPEG decoder, audio system, and display controller configuration.' 
A sujtable microprocessor is available as part number MC68HC16 from Motorola, Inc. of 
Austin, Texas. 

The display 36 is preferably an 11.3" LCD display (both active and passive matrix 
models available) with high-output fluorescent cold cathode backlight. The display 36 
preferably has 640 x 480 pixel resolution and 18-bit color depth. A suitable display is available 
from Sharp Electronics Corp. of Camas, Washington. The video controller for the display 36 
provides high-resolution, flexible onscreen graphics, overlay of graphics onto ful.-screen video 
playback, and LCD drivers for direct connection to display. 

The housing 12 is preferably made with a "tablet" form factor, which is easy to use and 
carry. The single piece housing design of the enclosure provides simplicity, ease of manufacture 
ruggedness, reduced weight, and serviceability. An optional docking station allows simple one- 
connection hookup to external video systems. 

The battery pack 38 is preferably a replaceable, rechargeable module based upon NiMH 
technology for high power density at low cost. Preferably the battery pack uses standard off- 
the-shelf battery cells having a 40 watt-hour capacity which provides sufficient power for over 
2.5 hours of continuous operation. This period of time is normally sufficient to view a full, 

feature-length movie. 

Also shown are inputs and outputs to/from the unit. While interconnections from the 
system controller 34 to other components are shown as a bus, these may in fact be realized via 
d.screte connections if appropriate for the specific off-the-shelf components chosen The 
architecture of the DVD drive module 26 has been designed for modularity of components and 
ease of expandability. 
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34. This interne ~ J I I t " " *• «- — 

« can b. buil. by Sizing a sma,,,, ^ ™-* 1 "">* <°"«P« 

— — ***** *** while „, JTL^T h C0 " ,I0 " er *"* 

conations po^e by ^ ^ «*» ^nconed base 

functtonality. For example, a u„i, wilh , ,„,. * , , ' P c °™°°'*<><* for added 

- — of d/a ~:::r ir rrr":^ 0 ^ 

modifications. connectors, and enclosure 



The system software of the portable DVD plaver of ih. 
aruc««d into two main levels- the hi.h , ■ n,Venti ' M " 

con.0, software. The softwat m „ T "* **• 

on a read only ™T " ** ^ ""^ * <■ "ored 

vanous hJar, ZponZ m ™ ^ '° W " ,eVe ' ^ — " *«* «* *• 

provtdes for power^Z "t ^ ta « ' ■** " 

^ « iur power-on and initialization, configuration nftu» ^ 

— ' - ~- - ^erface . ZT^ZT^ ^ 

Pro JZL^'' ^ — ' Appiication 

l,y WIin 0,6 typical user controls - e o "„i,„ - , • 
"show elapsed time " etc Th. , nnt , * • P y * sk, P t0 "«t track," 

• - -Lands •JII; ^.^^l": * " ~ — — 

-are to a^rmation ^ ^ ^ - - - in.er.ace 

proVideTa ^'ZZT ^ ""^ software. The Status- 

,C<U1 oreaJc between the user interface aP .H *k . . . 

lowing ^ user imerface ^ *• **• »nm,l softw^, 

interface, e.g., a customs user interface, to be substituted into 
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p,ayer's stimdard software. « « ^ be apprecialed ^ fc ^ ^ ^ 

7 mVen,i0n ^ 3 ^ ° f ^figurations, lowing a rich produc. fanti.y 

with excellent longevity and scalability. 

A power supply 40 taw. . Dc. lo . DC „„„„ ^ ^ ^ ^ 

ba,.ery or externa. DC source. He DVD p,a y er ,0 preferably derives a„ required internal 
voltages from a single DC power source. A bigh voluge inverter provides power needed for 
d,sp,ay back Hght. A charger (no, shown) is used ,„ re-charge fte interr*, rechargeable batiery. 
The use, interface includes a straightforward con.ro, snip „„ ft. fron, bezel for quick access to 
common tactions, and ft. software us^-imerfac. allows auHte«d. ye, simple and intuitive 
Player control. Software Contois rendered via rugh-resolution graphics can overlay via* for 
immediate user feedback. 

Figure 4 iHustia.es a system diagram of fte ^ Enhancemenl ^ ^ 
Figure 3) ,„ accordance wift one embodimen, of .he presen. invention. The IE> 30 includes a 
«. dmtensional video proofing chip archie 50 and a video outpu, processor 60. The 
chm arcmtecure 50 includes a ft, defacing suge 70 and , second dei„,er,aci„ g s,age 80 
and a s« of addressing and sentencing FIFOs 90. The firs, deinterlacing suge 70 includes 
progress,,, frame sequence detection a*, field difference processing. The second defacing 
suge 80 includes vertical fluency detection, sip, reversal detection and diagonal fa*, 
detection The video outpu. processor 60 includes horizons, scaling, color space conversion, , 
lo 6 bpp dilhenng and gamma, conlraa, and brightness corrections. 

The firs, deinterlacing stege 70 of Figure 4 is discussed in greater de,ail wi,h reference ,o 
F.gures 5-10. Figure 5 illusfrate, a meftod ,00 of combining fields in,o frames in a 
placing process in accordance wift one embodimen, of fte presen, invention. A s^ies of 
film frames 1 02 a, 24 rW per second are averted te video fields ,04 a, 60 fields per second 
The vdeo fields ,04 are ft. converted » deinteriaced frames ,06 by a dein.er.acin g process 
The de,»,erlaced frames ,06a ate created by combming fte .wo mos. r^en, fidds in.o a si„g,e 
frame, bu, fte ^aced frames , 06b are created by combination of differen. video fields and 
distinguished by bold arrows and bold borders. 

Referring ,o F.gure 5, when Field 2 is fte "currenf fie ld , ft eri DI Frame , js 
combmmg Field 1 (,he "las,- field, and Field 2 (fte "current" field,. DI Frame 2 is created in fte 
-me way by combining Fie,d 3 wift Field 2 when Field 3 is fte curren, fi.,d. ,„ comras, when 
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Field 4 is the current field, combining Field 4 and Field 5 creates DI Frame 3. In this case, the 
combination is the "current" and the "next" fields. When the field combination is done in this 
way, all of the deinterlaced frames 106 will be created by combining fields that originate from 
the same source frame. Because these combining fields originate from the same source frame, 
they are time correlated. Therefore, no motion artifacts exist in the deinterlaced frames 106. 

Combining fields into frames as shown in Figure 5 requires identification of the type of 
motion picture used in the original source. First, a source must be identified to be one in which 
original progressive frames exist; second, the sequencing used to create fields from the frames of 
the original source must be determined in order to assemble video fields into frames and avoid 
artifacts. 

Figure 6 is a block diagram of a video deinterlacer 130 in accordance with one 
embodiment of the present invention. A digital video stream enters the deinterlacer 130 through 
a FIFO memory module 132 before being written into a digital memory unit 134. The digital 
memory unit 134 has the capacity to store four complete video fields in a set of field buffers 
134a-d. The incoming field is written to each of the field buffers 134a-d in sequence. Therefore, 
the first incoming video field is written to field buffer 134a, the second incoming video field is 
written to field buffer 134b, etc. After field buffer 134d is filled, the next incoming video field is 
written to field buffer 1 34a again. 

During the time period of one field, the incoming field is written to one field buffer 134, 
and the three previous fields are read from the other field buffers 134. For example, if as shown 
in Figure 6, the incoming field is written to 134a, then field buffers 134b-d are being read into a 
set of FIFO memories 136, 138, and 140. The FIFO memories 136, 138 and 140 are provided to 
accommodate the reading and writing of the four video fields into a single physical memory unit 
134, and also to decouple the clocking domains of the incoming video, the memory unit 134, and 
the subsequent deinterlace processing stages. 

The labeling of the field buffers in 134a-d as "incoming," "1st," 2nd," and "3rd" previous 
fields are from the point of view of the incoming video streams. The outputs of the three read 
FIFOs 136, 138, and 140 are labeled Next Field, Current Field, and Last Field respectively. This 
labeling is from the point of view of the deinterlace processor 130 and implies that deinterlace 
processing imposes a delay of approximately two field time periods on the video stream. 
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The outputs of FIFOs 136, 138, and 140 are synchronized so that the three fields 
presented to subsequent processing stages are spatially concurrent. The three fields are then 
presented as inputs to an inter-field progressive frame detector 142 which produces a last/next 
(L/N) signal 144, a progressive frame detection (PFD) signal 146 and a stage 1 detection value 
5 (StagelDV) 148. The UN signal 144 is a control signal that instructs a field assembly stage 150 
to assemble the Current Field with the Last Field or the Next Field, which are three sequential 
fields from the input video stream. 

With the three fields, the field assembly stage 1 50 produces a progressively formatted 
output frame 1 52. If the Current Field is even, then the Next and Last Fields must be odd and 
10 vice versa. Therefore, the progressively formatted frame 152 output of the field assembly stage 
150 will always be a combination of one even and one odd field. This is important because the 
correct assembly of fields into progressive frames requires one even field and one odd field for 
input. 

The optional intra- frame deinterlacer 154 may provide additional processing to remove 
15 artifacts that occur within the output frame 152 under certain conditions, such as when the PFD 
signal 146 is de-asserted. If the frame detector 142 detects that the incoming video signals were 
originally from a source that contains progressive frames, the PFD signal 146 is asserted. Three 
original types of source video are detected: film at 24 frames per second, computer animations or 
frames rendered at 30 frames per second, or still images, in which no motion occurs in the image 
20 over a period of several fields. When the PFD signal 146 is asserted, the optional processor 154 
is disabled. 

However, if the frame detector 142 is unable to detect a progressive frame sequence from 
the incoming video fields, then it will set the L/N signal 144 to always assemble Current and 
Last fields in the field assembly stage 150. Then, the frame detector 142 de-asserts the PFD 
25 signal 146, which informs the optional deinterlacer 1 54 that artifacts may be present in the output 
frame 1 52 and that further processing may be necessary to remove the artifacts and create a final 
progressive output frame 152'. 

Detection of a progressive frame sequence requires keeping a history of the preceding 
fields. However, the progressive frame detector 142 can look only one field ahead. In cases 
30 where a large transition in the image occurs* it is possible for the progressive frame sequence to 
change because of edits in the original video source. These changes could require changes in the 
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way frames are created from income Mds , but it ls not ^ _ ^ rf 

amval of the incoming fields. 

For mis reason, me progressive frame detoctor .42 mus, be capable of detocting 
.ransmons in ,he morion pic,„re because tiansitions may i„dica,e edi<s ma, may change me 
progressive field sequencing. If a transition is dereaed, ,h« progressive Same detector 14' win 
de-assert me PFD signal ,46 for me time period required to determine me new progressive "frame 
sequence. This allows motion artifacts be removed by me optional deimerlacer 154 during the 
Ume penod needed ,o reacquire the progressive frame sequence. In eases where a progressive 
frame sequence is no, detected and me PFD ourpu, is no, asserted, the StagelDV ,48 output 
contams mag»i,ude information re,a,ed ,o m. amoun, „f motion presen, i„ „, irnage . m , 
mformation may be used in me option, deinfcrlacer ,54 to further process me image. 

Figure 7 is a system diagram of a progressive frame detector 142 in accords wim one 
embodtmen, of me pre^n, invention. The flam, detector ,42 memoes a field differencing 

I* '* ' «* * '* - ■ P~e frame pattern detection 

(PFPD) modu,e ,58. The field ditTercnctog module ,54 calculates d. difference between a Nex, 
Fteld ,60 and a Las, Field 162. processes me differences into m. SUge.DV ,48, a transition 
detection 3:2 value 1 66, and a phrfl, of equal field history bite 168. 

The frequency detection module 156 combines a Cunen, Field 164 and me Las, Field 
*2 mto a frame a* men dace* m, vertical high frequencies ma. resu,, from motion artifaca 

Then, ft. frequency de.««ion modu,e ,56 outputs a number of frequ^cy de.ec.ion history bits 
70 and a transition de.ec.ion 2:2 value ,72. FinaUy, m, PFPD ,58 ukes as inpu, me tiansition 

d.,ec„on 3:2 value 166, me p,„ra,i,y of equal field histot, bits 168, me frequency d.,ection 

htstory bus ,70 a* fte transition deletion 2:2 value ,72 to produce die L/N signal ,44 and the 

PFD signal 146. 

Figure 8 is , flow diagram of me processing stops within ,„. fiCd-differencing module 
154 ,„ accordance wim on. embodiment of the presen, invention. A Nex, amy of pixels 174 
wh,=h ts a subse, o„he Nex, Field ,60. and a Las, array of pixels ,76, which is a subset of the 
Las. F,.ld ,62 are me inpu* to a difference, ,78. The Nex, and Las. pixel anays 174 and ,76 
can be v.ewed as windows moving across meir respective fields. The "wmdow" is moved from 
M .0 ngh. and top ,o bonom. Each time me windows are moved, a new difference is computed 
The result of ,he difference operation 1 78 is an array of differences 1 80 



17 



WO 99/18727 

PCT/US98/21069 

Using a weighted average of the airay of differences 180, the StagelDV 148 is computed. 
The weighting is such that the difference values near the center of the array of differences have 
the most influence on the weighted average. The array of differences 180 is also input into a 
field difference accumulator 182 that sums the differences over the entire field to produce a field 
difference value 184. The previous five field difference values are stored in a memory module 
1 86 and are summed in an operation 1 88. 

The sum of the previous five field difference values is compared to the current field 
difference value in an operation 190, and the result is the Transition 3:2 output signal 192. The 
current field difference value 184 is compared in a threshold operation 194 with the value in a 
programmable field difference register 196. The result of the threshold operation 194 is an equal 
field bit 198, which is a single bit that indicates that the Next Field 160 and Last Field 162 are 
the same. The previous equal field history bits 1 68 are stored in a memory module 200 and are 
used in the PFPD 158 of Figure 7. 

Figure 9 illustrates details of the frequency detection module 156 in accordance with one 
embodiment of the present invention. Vertically adjacent pixels 206 from the Current Field 164 
and the Last Field 162 are assembled, as they would appear spatially, on a display. A frequency 
detection value is calculated in an operation 208. This calculation is performed to detect the 
frequencies that are associated with deinterlaced motion artifacts. In an operation 210, the output 
of the frequency detection is compared with a programmable threshold value 212. The results of 
five adjacent frequency detection values are stored in a memory module 214 and are summed in 
an operation 216. 

The output of the operation 216 is accumulated over the entire field period in a field 
frequency detection accumulator 2 1 8 to produce a field frequency detection value 220. The field 
frequency detection value 220 is stored in a memory module 222 that contains the previous 5 
field frequency detection values. The five previous field frequency detection values are summed 
in an operation 224 and the results are compared to the current frequency detection value 220 in 
an operation 226. The result of this comparison in operation 226 is the Transition 2:2 bit 228, 
which indicates that a transition has occurred in the 2:2 sequence. 

As part of a parallel process, a first previous field detection value 230 that was stored in 
the memory module 222 is transferred to a multiplier 232, and multiplied with a value stored in a 
programmable field frequency threshold register 234. The result of the multiplication is 
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compared in an operation 236 to the current field frequency detection value 220. The result is a 
relative frequency difference bit 238, which is then stored in a memory module 240. The 
previous ten relative frequency difference bits 242 are output to the PFPD module 158. 

Figure 10 is a system diagram of the PFPD module 158 in accordance with one 
embodiment of the present invention. The PFPD module 158 performs logical operations on a 
set of field difference history bits 244, the frequency detection history bits 242, the transition 3:2 
value 192 (see Figure 8), and the transition 2:2 value 228 (see Figure 9). After the input of the 
field difference history bits 244, a logical operation 246 determines the 3:2 pulldown detection 
bit by looking for patterns in which every fifth field is equal. Then, a logical operation 248 
detects still images by setting the STILL bit when the most recent four field differences are 
zeros. The state of the L/N control signal is set by a logical operation 250. 

From the input of the frequency detection history bits 242, a logical operation 252 detects 
a 2:2 pulldown detection bit by searching for alternating patterns of high frequency and low 
frequencies in successive field times and monitoring the frequency detection history bits 242. 
Then, a logical operation 254 determines the L/N control signal for the 2:2 pulldown case. The 
PFD signal 146 is determined from the 3:2 pulldown detection bit, the transition 3:2 value 192, 
the 2:2 pulldown bit, and the transition 2:2 value 228. 

Three source types of progressive frames are detected. Film converted to video using 3:2 
pulldown is detected by computing the difference between pairs of even fields or pairs of odd 
fields and looking for the condition in which every fifth difference is zero. This condition is 
indicated by the 3:2 pulldown signal. Computer generated animations using 2:2 pulldown are 
detected using a frequency detection method which looks for the frequencies associated with 
motion artifacts in every second combined frame, indicated by the 2:2 pulldown signal. Still 
images are detected when the field differences are zero for several consecutive fields, indicated 
by the STILL signal. 

Progressive frames are simply the logical OR of these three signals as shown by a logical 
OR gate 256. However, transitions must also be taken into account. As mentioned above, a 
transition is a large change in the field sequence that results from edits, or dropout of the video 
signal. If a large change is detected, then progressive frame detection, which depends on a 
history of several fields, may be unreliable for the period required to establish a progressive 
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frame pattern. In the examples illustrated, this period is ten field times or about one-sixth of a 
second for 60 Hz fields. 

In a set of logical operation 258, a pulse is generated under two conditions. One is that a 
3:2 pulldown sequence is detected and a transition occurs; the second is a 2:2 pulldown sequence 
is detected and a transition is detected in the sequence. Either of these cases triggers a timer 260 
which generates a pulse of a ten field duration, since ten fields is the time period required to 
establish a new progressive sequence. During this time, a logical AND gate 262 disables the 
PFD bit 146. As mentioned previously, if the PFD bit 146 is not asserted, then the optional 
intra-frame deinterlace processor 154 (illustrated in Figure 6) may remove motion artifacts 
during the timeout period. 

It will therefore be appreciated that the progressive frame detecting process of the present 
invention provides for elimination of edge artifacts in a video image. This is accomplished by 
identifying the type of the original motion picture and using that information to help combine 
video fields in the deinterlacing process. The combination of these techniques provides a low- 
artifact, high-resolution deinterlaced image. 

The second deinterlacing stage 80 of Figure 4 is discussed in greater detail with reference 
to Figures 11-21. Figure 11 illustrates a method 300 of deinterlace processing. A video field 
302 containing scan lines 306, and a previous video field 304 containing scan lines 308 is fed 
into a field combination deinterlace processor 310. Each of the video fields is 720 pixels by 280 
Pixels. The result is a 720 pixels by 480 pixels combined frame 312 with scan lines 314 sourced 
from video field 302 and scan lines 316 sourced from video field 304. 

When this simple deinterlacing of the prior art is performed, and a motion picture 
formatted for an interlace display is converted to a progressive format, a noticeable "artifact" or 
error arises because the image content of vertically adjacent lines is time shifted by 1/60* second 
as noted previously. The error is most visible around the edges of objects that are in motion. 

Figure 12 shows a two-dimensional array of pixel values 318 that is a subset of the 
combined frame 312 of Figure 1 1 that will be used to describe the present invention by way of 
example. The array of pixels 318 is shown having a width of 5 and a height of 7 pixels. The 
array 318 is labeled across the top CO to C4 indicating columns and is labeled vertically along 
the left side from the top to bottom R0 to R6 indicating rows. The pixels contained in array 318 
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■-^.iWW.* -"^..he^y 3,8 is also used ,„ detect 
diagonal features and finally 10 compute the resulting pixel. 

The array 3,8 is positioned so th a , . „ of even numberMj rows ^ ^ 
the most recent or -current" field of the original source, and a se, of odd numbered rows 322 
conutn pixels from the previous fie,d. The array 3 1 8 is then stepped across the combined frame 
3 2 (see F,gu« , , ) from left to „gh, horizontal* Each step causes the pixels in each of columns 
CI C2. and C3 and C4 to shift to the column to its immediate left. The pixels in column CO 
sh.fl out of the array 318, and a new column of pixels shifts into column C4. 

After the array 3,8 has been stepped across a„ the horizontal positions of combined 
*-» 312, n ,s stepped down verticaHy by two pix„ s and rerurned to the M side of the 

combtned frame 312. Therefore, even numbered rows 320 contain pixels ta, „,„ . 
k»ia n a jj « ^nuun pixels from the most recent 

field and odd numbered lines 322 contain pixels from the pre™. fie,d. The process men 
epeats ttseif as array 3,8 is then stepped across the combined frame 3,2 again from ,eft to right 
honzontally. At each position in the ^dimensional array, a detection value (DV, is cafculatL, 

Figure 13 Hlustrates a method 326 for obtaining an output pixel 338 from the two- 
dunenstona, array 3,8. In an „per«i„„ 328, a frequency detection value is obtain* using u,e 
seven ptxels of each column of the twc-dimensiona, array 318. Became there are five column 

rr z "rr de,ection °~ produci " s ' ^ ° f ~ — 

HI. fd2, fd3. and fd4. Next, an open-on 330 ,hresho,ds the se, of detection values f d0-fd4 
The* ,n an operation 332, the set of detect™ values fd0-fd4 is combined in a weighted average 
to amve at an ultimate detection value (UDV) 334. 

-wo + (, .„,*„. fd2) + (2 . fd3) + fd4y]4 ^ wei6htjn8 mums 

values closes, to me center of array 3 , 8 to have the greatest influence on the UDV 334 In this 
way, using five normally adjacent frequency detection values results in a low pass filtering 
operatton providing smoother transitions benveen area, within the combtned frame 3.2 where 

motion artifacts do and do not exist. 

The UDV 334 computed in operation 332 ,s used to control a mixing operation 336 
w^ch preferably implements ,h. following equation: pixelou, - (UDV • (pR2C2 ♦ pR4C2y2) ♦' 
«' " UDV) PR3C2, where pixelou, is the new output pixel of the deinterlacing operati „ 
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PR2C2 is a pixel in the array 3 1 8 at location Row 2. Column 2, pR4C2 is a pixel in the array 3 1 8 
at location Row 4, Column 2, and P R3C2 is a pixel in the array 3 1 8 at location Row 3, Column 
2. 

The result of the mixing operation is that the new value of pixel pR3C2 of the array 3 1 8 
depends on the UDV 334. If no motion is detected by the calculation of the UDV 334, then the 
pixel at P R3C2 will be the unmodified value of the pixel at that position in the previous field If 
a large UDV 334, a value of 1 results, then a strong motion artifact has been detected, and 
the value of pR3C2 is computed by averaging the values of pR2C3 and P R4C3 of the array 318 
The averaged result will not show motion artifacts because it is created from values of the most 
recent field that are time correlated with the most recent field. Detection values that are between 
0 and 1 will cause the pixel at pR3C2 to be a mix of pR3C2 and the average of pR2C3 and 
pR4C3. 

Figure 14A illustrates an image 340 showing operation 328 in greater detail. The image 
340 shows the computation of a single frequency detection value for one column of array 318 
Image 340 includes a distorted object 342 which is etTected by an interlace motion artifact The 
image is sampled along a line 344, which is shown for exemplary purposes. This sampling 
corresponds to one of the columns in the two^imensional array 318. In this example, line 344 
passes through an area where artifacts exist, but in general, a sampling of vertical adjacent pixels 
may or may not contain artifacts. 

m 

Figure 14B is a graph 346 of a set of samples 348 obtained by sampling along line 344 of 
F lg ure 14A. The set of samples 348 are plotted with the row numbers along the horizontal axis 
and the brightness^ intensity of the pixel along the vertical axis. From graph 346, it is apparent^ 
that the areas where motion artifacts exist, such as the set of samples 348, will show a 
characteristic frequency. This is frequency in space rather than in time and is most conveniently 
expressed as cycles per line rather than cycles per second or Hertz. The characteristic frequency 
is 1 cycle/2 lines or 0.5 cycle/line. 

Figure 14C is a graph of a sampled cosine wave 350. The characteristic frequency 
created by the motion artifact is detected by multiplying the set of samples 348 by the sampled 
cosme wave 350. The sampled cosine wave 350 has a frequency equal to the characteristic 
frequency of the motion artifact. Then, the result is integrated using the following equation- 



22 



WO 99/1 8727 

PCT/US98/2I069 



A-6 

f d = Z r W cos(2/?^r * 0.5cycle /line) 



where fd ,s the frequency detection value for one column of array 318 R is a I' H 
corresponding to the RO... R 6 of ^ 318 ^ has ^ ^ , " 3 '~ 

vertically adjacent samples 348. W ,S ^ set of 

The expression cos (2nR * 0 5 cvr i*/im.\ it 

R = 1 3 anH s , „ } P ^ * 1 R = °« 2 > 4. and 6 and - 1 for 

*• 3 ' 3,1(1 5 - If 1 «l -1 are substituted for RO R6 the , 

becomes: fd - (Y6/2 + Y4 + Y2 + Y0/2) . ^ + Y3 " N ^ 
2 -ause the mte^on ,s over the lL£ to The ^ ^ 7 " ^ * 
Abs(fd, ^method 32 6 of Figure 13 ,s repeated for each I V 
sel of frequency detection values 330. ' produc,n 8 *« 

Figure 1 5 is a graph 352 iliusfraUng Showing operation 330 in 

is a number in the range 0 to 1 Grmh «•>■•.. ^ h fd 

geutot. Graph 352 includes a non-thresholded scale 354 s™, i. 
values are threshold* to the thresbolded scale 35* n. ,. ,,• 

point 358 ,„ „. V a lu e of . " ' k \ * " ^ 

value of 0 Values J^ l ^j" 63 w ^ e ' ower "reshold point 360 are set to a 

Thresholding Jlt^Z Tn " ^ " "» "» ° ""' 

^^^^ng^tionrtdf^rptfd.LTHwrrru u 

11 JZ0 >> ^ 1 W is the lower threshold valn^ an w i rm ■ « 

threshold value. If tfd > l.o, then tfd = 1 0 o,h w L ^ ** 15 ** W 

•w, men im - l.o. Otherwjse, if tfd < 0 then tfd = 0. 

Provides'^ ^cTIrrt *" ^ °' - <— 

image. 41^1117 ' Cre ""' 8 **• ^ m ° W " 8 " ' — 

and en,™: Z T^Z "T T ^ " ^ " 

The UDV 334 is preferably US ed a. in . mixi ^ 

"7* C '7' 400 ~ « - ^ * ■»= Pixe, a, locatton R3C2 of J£ \ 
motion anifacts are detected, thenthe vaiue of the ITOV 334 w,„ be "0" 1 " 

400 will output the original oixel Rio if. . n " X '° 8 Circuil 

ongtnal p,xel R3C2. If the value of the UDV 334 is "1 .•• ^ ^ mixjng 
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circuit 400 wil, output the avemg, of to pixds ^ ^ W()w so 
mixing circuit 400 is the average of R2C2 and R4C2. 

Figure ,7 is a diagram of an exempt operation of the mixing circuil 4M wtal ^ 
UDV 334 ,s *ea,er than V bu, less than - ^ mjxing cjrcuj , ^ ^ 

the tt.ee.pue, array 402 by blending R3C2, and the average „f R2 C 2 and R4C2 to form a new 
output pixel 406 at location R3C2. 

Figure 18 is an illusion „f . method m for Mf ^ ^ a ^ 
-ay 410 that is a subs« of array 3,8 is input into a diagonal detection circuit 4,2 w« 
operas in pa^le, to the tnethod 326 of Fig^e ,3. ,f no diagona, feature is detected, then the 
diagon* eteetion circuit 412 prcduces no output. Howler, if a diagona, feature is detected, the 
diagona, ejection circuit 4,2 produces two outputs: a single bi, Si^ signal 4,4 and a multiple 
b , Slo^Fade signal 4,6. The specific method for ca,cu,a,ng the Si*, and S,opeFade signals 
414 and 416 is shown in Figure 21 and its corresponding description. 

The Sign si^ 4,4 is used to determine which pair of pixels is diagonal,, adjacent ,. 

M . h , ™ ° f *• ° ( • *P- »— 

Magnin.de ,s determined by the amount of contrast a,.ng th. diagonal feature. High contrast, 

«* « a diagond white line across a b,ac k background, wil, result in the highest vaiues of die 
SlopeFade signal 4,6. A lower contrast results in a lower value for the SlopeFade signal 416. 

Figure 19 is a block diagram of a diagona, mixing circuit 4,8 in accords with one 
embodiment of the present invention. The diagona, mixing circuit 4,8 inCudes a multip.exer 
4 0, a a, 4J2- a ^ ^ m ^ ^ on die Sign signa, 

4 4 to determine which pair of diago»a„y adjacent pixels a. used. After a pair of diagona,,, 
adjacent pixels is chose, the nrs, mixer 422 b,ends ttie pixel values tha, are vertically adj J 

0 RJO with .hose that are diagonally adjacent to R3C2. The amount of blending is determined 

1 d^iT ^ 416> ^ " PrOPOni ° Ml 10 ° f di ^ Ml »-» 

The second mixe, 424 is the final mixing stage and is identica, to me mixing circuit 400 
^own m Figure 16 . The second mixer 424 produces an output that is determined by input pi xC 
MC2 a* the outpu, of the first mixer 422. Th. UDV 334 is the control input for second mixer 
424. in summa^, the new pixel value a, R3C2 is compuled from pixel values from ,he 
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sectary array The control signal, for delCTnining fina , 
the S,gn signal 414 and the SlopeFade sigM | 416. 

ourpu, 7Z T ' 7™ Sh ° Wi " 8 ™ - for calculating U,e 

outpu of ,he dragona, miMg circllil 418 . ]f „„ ^ ^ « 

rrr. ?; oo ' pu * of ^ mixms ** - - - ** — 

diag nally a^acen, * «C2 along a line 428 or a line 430 are used » calculate „. outpul pixe , 
The Sign signal 414 is used lo determine which line 428 or 430 is used. 

Figure 21 is a flow char, of a diagonal detecdon men,* 432 in accordant ^ one 
embodiment „ accordance ^ one rf ^ ^ 

* ws the flow of logical and mamematical operarions used ,o cor„ pu ,e to SlopeFade signa, 
^ the S,gn signal 4,4 from me seconda* array 4,0. The comer pixels are divided £1 

«. pair, and two venica, pairs h y an operation 434. The horizontal pai rc are laheled hv 
an 4 and the two vertical pairs are labeled w2 „d w4. Differences are cmputed for ^ 

nonzonta, and vertical difference vector for the secondary artay 4,0. operln 
computes the ahsolute value of die horizon* a^ vertical difference vectors. A threshltag 
values used ,o adjust the magnitude of the SlopeFade output 4,o in « operation 440 2 
mm of operation 440 is an unified SlopeFade signa, ( u„ Q ua,S,opeFade, ma, ,s Z 

22 * 7 "T our by - DiagDe,M ^ - - Sto ^ » * 

parallel operations of the method 432. 

The signs of the horizontal and vertical difference, from operation 434 are recorded and 
-re ,„ an operation 442. The s ig ns mdicate wheher ,he operation 434 result in posiZ 

S.o P «DisQual is "0." ° ~ "°' to *- 
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ln operation 444. the diagonal toector looks for ^ ^ 
.arse; ,n particular, th, feaotre must be larger ^ me nine . p ,, e , ^ 4 
,nput ,o,he d.agonal processor. Image , eatnres to „ ^ fc 

ause u,e .agonal pressing „ inconeclly de,ec, a diagonal fea,ure. These small feMurK ^ 
be detected by observing tile signs and setting SlopeDisQual accordingly. 

An operation 446 comparcs the magnitude of ^ ^ 

computed ,„ op^tion 438 to detect a diagonal feature, Then, the DiagDelMt signa| „ 
in an operation 446 using the horizontal and vertical vectors. The ratio of the difference of ft. 
honzontal and vertiea, vectors and the sum of ft. horizonta, and veruca! vectors is adjusted by a 

threshold value, diagDetThresh. 

A final SlopeFade output is computed in an operation 448 in which two single bi, inputs 
are used ,0 ouality the SlopeFade output The ft* hi, is . DiagonalDetec, hi, and the second bi, 
» . SlopeDisQual bit SlopeFade will be zero if DiagDetec, is 0 or if SlopeDisQual is a , 
o^SlopeFade wil, take fte value of uoQuaKlopeFade. The SlopeDisQual signal changes' 
^SlopeFade s,gnal ,0 zero for cases where the slope of fte diagonal feanne cannot he reliably 

Finally, ft. Sign signal 4.4 is compu,ed in a, operaUon 450 using the sign bits produced 
by ,he sums of the horizontal and vertiea, vectors performed in opera,ion 436. The Sign signal 
414 » computed using this logical operation ,0 determine the s,ope of ,he diagonal feantre. 

The video ourpu, processor 60 of Figwe 4 is discussed in greater deui. wim reference to 
Hgures 22-27. F,gure 22 shows an example of a variable scaling FIR filter 500. The variable 
seating FIR filter 500 includes a shift register 502 with a series of registers 504 each of which is 
«~M to a clock 506. Each register 504 is connected ,0 one of a se, of multipliers 508. 5,0 
and 5 2 by one of a series of filter taps 514. Multipliers 508. 5,0. artd 5,2 accept two inputs ,„ 
be mu tiphed. The firs, mput is an eight-hi, data word, and the second input is a coefficient 
Multiphers 508. 5,0. and 512 differ from each other in tha, t„cy ^ coefricienls „ „ 
«W numbers of hi,, Mu„ip,iers 508 use ft. leas, number of bits per coefficient and 
"■ultiphers 5,2 use the mos, bits per coefficient. Multipliers 508. 5,0. and 5,2 are connected .0 
a coefficient storage unit 5 1 6 and an adder 518. 
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Eight bits of data are input into the variable scaling FIR filter 500 through the shift 
register 502. The output of each register 504 is coupled by one of a series of filter taps 5 14 to 
one of a set of multipliers 508, 510, and 512 to be multiplied by a coefficient produced by the 
coefficient storage unit 516. A new set of coefficients is entered into the multipliers 508, 510, 
and 512 by the coefficient storage unit 516 on each cycle of clock 506. The results from each 
multiplier 508, 510, and 512 are summed by the adder 51 8 to produce a filtered output sample. 

Figure 23 is a graph of low-pass filter coefficients 520 in the time domain stored in the 
coefficient storage unit 516 to produce coefficients. The low-pass filter coefficients 520 are 
represented by the equation below. 

< 1 > = v!/ /C )M * sin[2 fc^i - 1/2)]* {0.54 + 0.46 cos[2*(i - 1/2)/^]} 

m 2fcx{t - 1/2) 

The unquantized and continuous wave is represented by a curve 522. Filter coefficients 
524 are shown plotted on or near the curve 522. Some coefficients 524 appear slightly off the 
curve due to the error introduced by quantizing each coefficient to a limited number of bits. 

Figure 24 shows the coefficients 524 organized into L=8 sets of mults=6 coefficients per 
set. The sum of all the coefficients in each set i where i-1 to L is represented by the equation 
below. 

J q muia 

(2)5(i)= 5>(IO'-l) + i) 

Figure 25 is a flow chart of a quantization method 526 in accordance with one 
embodiment of the present invention. The method 526 initializes with a given set of parameters 
528 needed to compute the coefficients where L is the numerator of the scaling ratio L/M; mults 
is the number of multipliers used in the FIR filter; and n is the number of bits to which the 
coefficients will be quantized. An operation 530 computes the FIR filter coefficients using 
equation 1. In an operation 532, the coefficients are organized from left to right and labeled c(l), 
c(2), c(3),...c(L * mults). 

In an operation 534, each coefficient is quantized to n number of bits by rounding. Next, 
an operation 536 starts a loop which is executed L times, one time for each coefficient set, in 
which all the coefficients in each set are summed. An operation 538 sums the coefficients for 
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set(i) represented in Figure 24. Then, an operation 540 tests the result of the summing operation 
538 for a 1 .0 result. If operation 540 produces a true result, then no further processing is done. 

The loop iterator is incremented in an operation 542, and control passes back to operation 
536. If operation 540 produces a false result, then a Fudge value F is computed in an operation 
5 544 by subtracting the sum produced in operation 538 from 1.0. Then, processing proceeds to an 
operation 546 in which one of the coefficients of set(i) is changed so that the sum of all the 
coefficients in set(i) = 1 .0. 

Figure 26 is a flow chart of the operation 546 from Figure 25 in much greater detail. An 
operation 548 is a loop set up to step through the coefficients of set(i) in a particular order. The 
1 0 order starts with the outermost coefficient of the set(i), and then moves toward the center of the 
set. Operation 548 is executed mults times, because there are mults number of coefficients per 
set. Next, an index k is computed in an operation 550, which is used to process the coefficients 
in the previously stated order. 

The coefficients at the left or right edge of the coefficient set must be handled as a special 
15 case. Therefore, an operation 552 is performed on the index k to determine whether the 
coefficient to be processed is either the first coefficient, c(l), or the last coefficient, c(L * mults). 
If operation 552 determines that the coefficient to be adjusted is the leftmost one, that is, c(l), 
then an operation 554 is performed. 

Operation 554 evaluates whether the absolute value of the sum of c(l) and F is less than 
20 or equal to the absolute value of the coefficient to the right of c(l). This means that c(k+l) < 
c(2). If the result is true, then c(l) can be adjusted by adding F without creating a discontinuity 
or divergence from the zero axis. The coefficient is adjusted in an operation 564, and operation 
546 is exited successfully. If the result is false, then operation 560 performs a loop iteration. 

If operation 552 determines that the coefficient to be adjusted is the rightmost one, that is, 
25 c(L * mults), then the method proceeds to an operation 556. Operation 556 evaluates whether 
the absolute value of the sum of c(L * mults) and F is less than or equal to the absolute value of 
the coefficient to the left of c(L * mults), that is, c(L * mults - 1). If the result is true, then c(L * 
mults) can be adjusted by adding F without creating a discontinuity or divergence from the zero 
axis. The coefficient is adjusted in 564, and operation 546 is exited successfully. If the 
30 operation 556 result is false, then a loop iteration is performed in operation 560. 
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If operation 552 determines that the coefficient to be adjusted is neither the leftmost or 
nghtmost one, then an operation 558 is performed. Operation 558 evaluates whether the sum of 
c(k) and F is outside the limits of the coefficients on the left and right, that is c(k - 1) and c(k + 
1), by evaluating the equations c(k-l) < c(k) < c(k + I) and c(k-l) > c(k) > c(k + l). If either of the 
equanons ,s true, then the coeffiden, c(k) is set equal to c(k) + F in operation 564 and a 
d.scontinuity is not introduced. Therefore, operat.on 546 is successfully exited. If either of the 
equations is false, then a loop iteration is performed in operation 56<X 

Operation 560 increments the loop iterator variable so the next coefficient can be 
evaluated. An operation 562 asks whether all the coefficients in the set(i) have been evaluated. 
If all the coefficients in set(i) have not been evaluated, then control is passed to the top of the 
operation 548 and the procedure is repeated for the next coefficient. If all the coefficients in 
set(i) have been evaluated, then the coefficient set cannot be quantized to n bits without 
introducing an unacceptable discontinuity into the coefficient set. Therefore, n is incremented in 
an operation 566 and operation 546 is exited. Control is then returned to the method 526 at 
operation 534 (in Figure 25) where the original coefficients are quantized to the new value of n 
and the process is repeated. In the cases where operation 546 is successfully exited, control is 
returned to the method 526 (in Figure 25) at the operation 538 and the next coefficient Se t(i) is 
evaluated. 

The two-dimensional chip architecture 50 and the addressing and sequencing FIFOs 90 of 
Figure 4 are discussed in greater detail with reference to Figures 27-33. Figure 27 illustrates a 
video frame 600 in accordance with one embodiment of the present invention subdivided into a 
number of vertical slices 602 for a slice scanning sequence exemplified by a corresponding 
number of scan lines 604. Each slice 602 is scanned in a format similar to that used in a 
conventional raster-scanned sequence, with the scanning sequence proceeding to the subsequent 
slice when the end of a given slice is reached. The advantage of this format is that the length of 
the line memories is reduced by a factor roughly equal to the number of vertical slices used. 
Line memories are still necessary, but they are now much shorter than before, resulting in a 
much reduced on-chip memory requirement. For instance, if the number of slices were set to 10, 
the reduction in on-chip memory would be by an entire order of magnitude. 

However, difficulties do arise from utilizing this "slice" scan organization. First, it is 
often the case that processing must simultaneously be done in both the horizontal and vertical 
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directions. This r«u„s i„ a problem on the m „ righ , ^ 
data ouis.de ih= slice may no, be available. S*o„d, the conventional raster-scan sequencing has 
been changed, resulting U, a poieniial incompatibility with common video sources and 
d.sp.a y /s.orage devices. Both of these problems will be addressed in the Mowing discussion as 
solved by the present invention. 

Figure 28 illustrates an example of a slice core 606 that has a problem with unavailable 
data on its left edge 608 and right edge 610. For purposes of illustration, unavailable data is 
shown only on the left edge in Figure 28. Video processing requires that data surrounding a 
given puel be available in both the horizontal and vertical directions (in this case 5x5 matrices 
612 and 614 centered on the pixel). 

Processing matrix 6 1 2 resides in the center of th. slice core 606, so .here is no problem 
w.th availability of data because it is available in both horizontal and vertical directions on al, 
sides of processing man*. 612. The situation a. the top edge 618 and bottom edge 620 of the 
sl.ee core 606, where cat, above the .op-mos, pixel and da* below ,„. bottom-mos. pixel is no, 
avatlable, is identica. to that with the conventional raster-scanned format. This can be soived in a 
number of ways, such as substituting zero data for the nonexistent upp,^ pixel ^ 

Therefore, the top and bottom edges 618 and 620 of the slice cor. 606 will no, cause problems 

with unavailable data. 

In contrast, processing matrix 614 is on the left edge 608 of the slice core 606 
honzontal.y adjacent data. Thus, two columns of pixel data 616 are missing because they are 
outs.de the left edge 608 of the slice core 606. To resolve this situation, data for these columns 
are provded from the slice immediately to the left of the slice being processed. 

Figure 29 illustrates a slice 622 that includes a pair of thin vertical slices or "wings" 624 
and 626 along the left and right edges 608 and 610. Wing 624 has been added to the slice core 
606 to provide the pixel data needed for the processing matrix. Wing 626 has been added to the 
nght edge 610 of the slice core 606. Because wing 624 has been added to slice 622, processing 
matnx 614 no longer suffers from the lack of data outside of the left edge 608 of slice 622. 

Figure 30 illustrates an overall structure of overling slice/wing combinations 628 
Shce 622 from Figure 29 is shown as an exemplary slice. Wings 624 and 626 of slice 622 are 
composed of data from a pair of adjacent slices, one to the left and one to the right of slice 622. 
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More ft. missing two len columns of pixels jn ^ ^ ^ ^ ^ 

ngh, most commns 630 of a s,ice 632 immediately ,. me M office 622. So in a sequence of 
s ces 634. the lefi-mos, wmg of slice N overlaps the core of sltce N-l, whi,e ft. rightist Wlng 

of slice N-l overlaps the core of slice N. 

Figure 3! is a flow char, iltas^g a memod 636 of processing video in accordance with 
one embody, „f ft, prKtnl invenlio „ ^ ^ , ^ ^ ^ 

sl,ce 622 with slice core 606, wing 624 and righ. wing 626. The M wing 624 is divided in,o 
a led o».er wing 638 a* , led toe, wing 640. The righ. wing 626 is divided in.o , righ, ou,er 
wtng 644 and a righ, inner wing 642. In .his example. 0,. video processing block ha, muldple 
processing s,.ges, each with its own requirement for horizontal pixels on each side of the cen,.r. 

The me,hod 636 utihzes a as, processing suge 646 a* a secom, processing suge 650 
The firs, processing suge 646 utilizes ami then removes me ou«r wings 638 am. 644 leaving an 
ournu, slice 648 consist of me sUce core 606 and the inner wings 640 and 642. The s*o„d 
processing suge 650 utilizes ^ ft, r^ves me inner wings 640 and 642. Therefore, ft, 
»p «4 and 626 are effectively removed in ft. processing and me output of me ptocessing 
block is a shce 652 with the width equal to the original slice core 606. 

On. effect of the wings 624 and 626, is to increase the on-chip slice-line memory 
regents by the width of ft, wings 624 and 626. However, the wing wtdth is typically smal. 
relative* the overall slice width. The acbaa, slice and wing width is implemenution dependent 
and wtll depend on processing requirement art available external memory bandwidth. 

A preferred embodiment of me present invention utilizes ftree vertical video processing 
blocks. The firs, prc^sstog suge 646 requires a pair of outer wings 638 and 644 having a width 
of 2 puds; the second processing suge 650 requires a pair of inner wings 640 and 642 with a 
mdth of 4 pixeU; ami the ft* processmg stage 652 requtres no wings as the specific pressing 
algonftm used does no, require data horizon*, ,„ me vertical dau being processed. The slice 
core wdlh chosen was 36 pixels, resulting i„ an initial inpu, slice width of 48 pixels (Core + 
left-mner-wing + righ,-,„ner.„i„g ♦ M-outer-wing + righ,M,„,er-wi„g - 36>4-4 + 2 + 2 - 48.) 

Unfort U „a,e,y, the dau mputs and outputs of the venical processing blocks are no. in the 
raster-scan video f orm at, which is stared to virtually a„ video inpu, sources and video ou,pu, 
dtsplay and storage devices. The presen, invention includes a standardized i„p„,oum u , forma, 



31 



WO 99/18727 

PCT/TJS98/2I069 

conversion, which is accompHsheo via u. use of a memory exumal ,„ *, video process,,,, 
devcc. A commodity DRAM memory device is used for reasons of cos, and avaiiability. 

Depending on the type of video processing to be done, a field or Same size buffers) 
serves other necessary purposes in addition ,o conversion between Ml fie-d/frame raster-scan 
and s„ce«can fonnats. For insunee, me defacing process typically requires ^ (soniMimK 
severa!) field buffers ,o store rnuhiple fields of video dau f„, temporal processing . Buffm „ 
also needeo in Same rate conversion, where me output Same rate is different man the input rate- 
» tins case muluple ourpu, field or frame buffers may be required for me frame rat. conversion' 
process. 

Figure 32 illustrates an example of a systom diagram for a slice-based video processor 
654. A firs, input buffer 656, a second input buffer 658, a firs, output buffer 660, and a second 
output ^ buffer 662 are required for the slice conversion process. Because video applicant 
typ-caily reou.re.real.ftne input and output, and because me scanning process for a conventions, 
raster-scan and a shcscan are differ™, the firs, inpu, buffer 656 is useo ,o store «, video inpu, 
data stream from the inpu. dau formaaer 664. The second mp„, buffer 658 (mied fa fc 

prevous field/frame period) is used to provide dau to the veruc* video processing section 666 

in a shce-scan format. 

i 

A similar process , s used for output. The second output buffer 662 reccves processed 

buffer 660 fi„.d m .h. previous fieldW pen'*) is us* to output dau in tne convent 
r^r-scan forma, to me „ Mp „, dau formaaer 668. The output da* stream may actually provide 
data to add^na! video processing stages tua, process dau in the borizonul direction only (eg 
nonzonul scaling and color space conversion). 

Figure 33 aiusBa.es a sys,em diagram of one example of a video processing chip 
arch^mre 670. The video processing chip archi,ec,ure 6,0 includes a video processor 672 and 
an ex,emal memory source 674. In Ms parncuiar video processing implemenuUon, mu.,iple 
tnpu. fie d storage (for .empora, processing, is quired. Video data is provided to an inpu, suge 
676 m d,e v,deo processor 672 mat adds me redundant wing dau direc.ly i„,„ „ e video dau 
stream. The data is men written (wtngs included) in a raster-scan sequence to a firs, field 
memory buffer 678 in me exuma, memory source 674 by .he memory controller 680 which is 
located inside the video processor 672. 
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In subsequent field periods, data is written to a second field memory buffer 682, a third 
field memory buffer 684, and a fourth 686 field memory buffer in sequence. Next, data is read 
from the second, third and fourth field memory buffers 682. 684, and 686, all of which are in the 
external memory source 674 and connected in parallel. The field buffers 682, 684, and 686 feed 
the vertical video processing section 688 that is located inside the video processor 672. The data 
is processed in the vertical video processing section 688, which removes the wings. 

Data is written from the vertical video processing section 688 in a slice-scan format back 
to a first frame buffer area 690 in the external memory source 674. Data is read from a second 
frame buffer area 692 in the external memory source 674 in a conventional raster-scan sequence 
for input to a horizontal processing block 694 located in the video processor 672. The output of 
the horizontal processing block 694 is in raster-scan format and is the output of the video 
processor 672. 

In one preferred embodiment of the present invention, video input data is provided as 
interlaced fields of data in a 720x240 pixel field format. Each video field is conceptually broken 
into 20 slices having a width 36 pixels, with each slice having left and right wings of 6 pixels 
each (outer wings of 2 pixels each and inner wings of 4 pixels each). The wings are added at the 
appropriate points in the video input data stream, and the resulting data stream is written in 
raster-scan sequence into a first field buffer in an external SDRAM. 

Three fields of data are read from the SDRAM simultaneously. The data for these fields 
is sourced by second, third, and fourth field buffers and is read in vertical slices 48 pixels wide 
(slice core and wings) by 240 rows. The data is processed by a first vertical processing stage that 
provides slice-scan format data at twice the input rate of a single field to a second stage. Slice 
data input to the second stage is formatted in slices of 44 pixels wide by 480 rows (due to the 
rate-doubling action of the first stage). The second vertical processing stage processes the data 
and provides 36 pixel wide slice-scan format data at the same rate as the input to that stage to a 
third vertical processing stage. 

The third stage is a vertical scaler and performs no horizontal processing, and so does not 
require wings on the slice format data. Data is output from the third processing stage in a 36 
pixel wide slice-scan format to a first frame buffer area in the SDRAM. The number of rows in 
each slice is dependent on the specific vertical scaling ratio chosen. Data is input to a horizontal- 
only processing stage in conventional raster scan format of 720 x 480*M pixels, where M is the 
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vertical scaling factor in the third vertical processing stage. This data is processed by the 
horizontal processor (which includes a horizontal scaler) and is output in a conventional raster- 
scan format at a resolution of 720*N x 480*N, where N is the horizontal scaling factor. 

Overall, this implementation results in a greater than 10X reduction in on-chip memory 
requirements due to the slice-scan architecture. This expense saved with the reduction in on-chip 
memory requirements more than offsets the additional required external memory, and provides a 
variety of prototyping and production options. 

The video output processor 60 of Figure 4 is discussed in greater detail with reference to 
Figures 34-38. Figure 34 is a diagram of an asynchronous interface 700 in accordance with one 
embodiment of the present invention. A data source 702 provides data synchronous to clock CI 
in an input data stream 704 to a synchronizer 705. Once the data has been received by the 
synchronizer 705, a write control logic (WCL) unit 706 directs the writing of this data into one of 
a first RAM buffer 708, a second RAM buffer 710, and a third RAM buffer 712 by providing 
control and address signals to each of the RAM buffers 708, 710, and 712. 

Once data has been written into the RAM buffers 708, 710, and 712, a read control logic 
(RCL) unit 714 directs the reading of data from the RAM buffers 708, 710, and 712 by providing 
address and control signals to the RAM buffers 708, 710, and 712. The retrieved data is then 
output from the synchronizer 705 in a continuous output data stream 716 at clock rate C2 to a 
data destination 718. 

Reading of data from a given RAM buffer 708, 710, or 712 does not start until that 
particular RAM buffer 708, 710, or 712 has been filled by the WCL unit 706. After one of the 
RAM buffers 708, 710, or 712 has been filled, the next RAM buffer 708, 710, or 712 is written 
to in sequence, and data can be read from the previously filled RAM buffi*) 708, 710, and/or 
712. Although this particular embodiment uses three RAM buffers it should be understood that 
more RAM buffers may. be used depending on the system requirements. 

A synchronization logic unit 720 external to the synchronizer 705 coordinates the reading 
and writing of data. Optionally, the synchronization logic 720 could be part of the synchronizer 
705 .tself. Multiple synchronization schemes may be implemented. For example the 
synchronization logic 720 could signal the WCL unit 706 and the data source 702 when to start a 
data transfer. A predetermined period later, when the first RAM buffer 708 has been filled and 
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the second RAM buffer 710 is in the process of being filled, the synchronization logic 720 
instructs the RCL unit 714 to begin reading data from the first RAM buffer 708 and to provide it 
to the data destination 718. 

An alternative scheme could have the data transfer be more or less free running. The 
WCL unit 706 would be accepting data from the data source 702 as it is available, writing that 
data to the next available RAM buffer 708, 710, or 712, and signaling the RCL unit 714 that the 
respective buffer is full. The "buffer full" signal must be synchronized from the CI to C2 clock 
domain, typically through a dual-rank synchronizer. The RCL unit 714 could either latch the 
state of the "buffer full" signal, or return a handshaking signal (which must be synchronized 
from the C2 to CI clock domains) back to the WCL unit 706 indicating that either the "buffer 
full" signal has been received or that the RAM buffer 708, 710, or 712 has been emptied. 

Figure 35 is a diagram of an alternative asynchronous interface 722. Data transfer is 
initiated via an external signal to the WCL unit 724 that indicates that a data transfer is to begin. 
The WCL 724, synchronous to clock CI, generates write enables and addresses for the a first 
single-ported RAM buffer 726, a second single-ported RAM buffer 728, and a third single-ported 
RAM buffer 730. The single-ported RAM buffers 726, 728, and 730 have synchronous write 
and asynchronous read capabilities. 

Only a single write enable is active at any one time, with the write enable for RAM buffer 
726 being asserted first, followed by that for RAM buffer 728, then RAM buffer 730, then back 
to RAM buffer 726, and so forth. At the point that RAM buffer 728 is full and RAM buffer 730 
is approximately half full, an external signal is asserted to the RCL unit 732 to initiate reading of 
the data from the RAM buffers 726, 728, and 730. The RCL unit 732, synchronous to clock C2, 
generates addresses and data select lines for the RAM buffers 726, 728, and 730 for read 
operations. 

Individual RAM buffers 726, 728, and 730 receive a read or write address based on 
which RAM buffer 726, 728, or 730 is currently being written to, /.£?.. one RAM buffer 726, 728, 
or 730 being written to receives its address from the WCL unit 724 while the other two RAM 
buffers 726, 728, and 730 receive their addresses from the RCL unit 732. A data MUX 734, 
controlled by the RCL unit 732, selects the output data stream from the one of the three RAMs 
currently being read from. Although in this example, external signals start the data transfer read 
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and write operations, an optional synchronization logic unit 736 may be present in the absence of 
one or more of the external "start" signals. 

Figure 36 is a diagram of a 3-buffer synchronizer sequence 738 illustrating the 
sequencing and relative relationships of read and write operations to three RAM buffers. 
Potential clock synchronization delay issues pertaining to real-time buffer sequencing for the 
continuous data output stream are mitigated by the fact that read and write operations are 
separated from each other by a skew 739 of approximately 1 54 RAM buffers. 

The skew 739 provides a buffer region of about one half of a RAM buffer separating 
potential collisions between read and write operations occurring in the same RAM buffer at the 
same time. This buffer region is there to absorb inaccuracies in the "start writing" and "start 
reading" commands, variations in data lull periods in the input data stream, and deviations in 
timing due to synchronization delays and/or offsets between the two clock domains. 

Figure 37 is a flow chart of a method 740 for sequencing through RAM addresses and 
modules in accordance with one embodiment of the present invention. The method 740 begins at 
an operation 742 in which the RAM address for both read and write operations is set to zero, as 
is the selected RAM buffer. Next, an operation 744 asks if the data is valid. If the answer is no, 
operation 744 repeats itself until data is valid. If the answer is yes, then the method 740 proceeds 
to an operation 746 which asks if a variable called EndCnt is equal to 1. If the answer is yes, 
then the last RAM module has been reached and an operation 748 increments to select the next 
RAM module before executing an operation 750. If the answer is no from operation 746, then 
operation 750 increments the RAM address. 

Next the method 740 proceeds to an operation 752 that determines whether the last RAM 
address has been reached. If it has, then EndCnt is set equal to 1 in an operation 754. If the last 
RAM address has not been reached, then EndCnt is set equal to 0 in an operation 756. From 
both operations 754 and 756, the method 740 loops back to operation 744 where the whole 
process starts again. Note that while read and write operations follow the same basic sequence, 
they are skewed in time by approximately one and one half RAM buffers, as noted previously. 

Figure 38 is a diagram of a synchronizer 758 in accordance with one embodiment of the 
present invention intended for use in a video scaling application: The input source generates an 
8-bit wide input data stream 760 for the synchronizer 758. The input data stream 760 runs at a 
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clock rate of 732 MHz (CI) with an average data rate of 48 MHz (C2). Three 8-bit wide by 16- 
element deep RAM buffers 762, 764, and 766 are used. A WCL unit 768 generates three RAM 
write-enable signals and a 4-bit RAM address. 

A video scaler produces a "data valid" signal indicating when valid pixel data is available 
on the input data port. Whenever data is valid at this port, it will be written into the RAM 
buffers 762, 764, and 766. The WCL unit 768 performs this write operation as described above. 
The WCL unit 768 is composed of a first counter 770 counting from 0 to 15 and producing the 
RAM address, and a second counter 772 (a ring counter) counting from 0 to 2 and producing the 
RAM select/write-enable lines. 

At approximately the point at which one and one half RAM buffers have been filled, an 
external signal is received which indicates that data output should begin. A RCL unit 774 then 
begins generating a series of addresses to the RAM buffers 762, 764, and 766 to extract the data 
previously written under control of the WCL unit 768. The RCL unit 774 is also composed of a 
third counter 776 counting from 0 to 15 and producing the RAM address, and a fourth counter 
778 counting from 0 to 2 and producing the data output through a MUX 780. 

The MUX 780 produces an output data stream 782 that must be continuous since it 
provides the real-time data stream for a raster-scanned display. Each scan line of video data is 
1280 8-bit data elements long, and a continuous data stream of this length must be produced for 
each output scan line. Logic in the synchronizer circuit recognizes the end of a horizontal scan 
line and resets the WCL unit 768 and RCL unit 774 at the end of a line in preparation for the 
next line of video data. 

While this invention has been described in terms of several preferred embodiments, it 
will be appreciated that those skilled in the art upon reading the preceding specifications and 
studying the drawings will realize various alterations, additions, permutations and equivalents 
thereof. It is therefore intended that the present invention includes all such alterations, additions, 
permutations, and equivalents as fall within the true spirit and scope of the invention. 

What is claimed is: 
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1 • A digital image enhancer comprising: 

a deinterlacing processor means receptive to an interlaced video stream, said 
deinterlacing processor means including a first deinterlacer and a second deinterlacer and 
providing a deinterlaced video stream; and 

a video output processor means receptive to said deinterlaced video stream to provide a 

scaled, deinterlaced video stream. 



2. A digital image enhancer as recited in claim 1 wherein said first deinterlacer is 
operative to analyze progressive frames of said interlaced video stream in an attempt to 
determine an original source type and sequencing used for the interlaced video stream. 



3. A digital image enhancer as recited in claim 2 wherein said first deinterlacer is further 
operative to convert said interlaced video stream into a deinterlaced video stream using a 
conversion process that is dependent upon said detection of said original source type and 

sequencing. 



4. A digital image enhancer as recited in claim 1 wherein said second deinterlacer is 
operative to reduce motion artifacts detected by a frequency analysis of said interlaced video 



stream. 



5. A digital image enhancer as recited in claim 1 wherein said second deinterlacer is 
operative to detect diagonal features and to smooth said detected diagonal features. 



6- A digital image enhancer as recited in claim 1 wherein said deinterlacing 
processor means processes said deinterlaced video stream in vertical slices. 



38 



WO 99/18727 PCT/US98/2 1 069 

7. A digital image enhancer as recited in claim 1 wherein said video output 
processor means is operative to scale said deinterlaced video stream to modify a video display 
output format of a video output stream. 



5 8. A digital image enhancer as recited in claim 1 wherein said video output 

processor means includes a data rate synchronizer between a first data rate of said deinterlaced 
video stream and a second data rate of a video output stream. 

9. A digital image enhancer comprising: 

10 a deinterlacing processor receptive to an interlaced video stream and operative to provide 

a deinterlaced video stream; and 

a video output processor receptive to the output of said deinterlacing processor, wherein 
said deinterlacing processor means processes said interlaced video stream in vertical slices to 
provide a scaled, deinterlaced video stream. 

15 

10. A digital image enhancer as recited in claim 9 wherein said deinterlacing 
processor is operative to analyze progressive frames of said interlaced video stream in an attempt 
to determine an original source type and sequencing used for the interlaced video stream. 

20 11. A digital image enhancer as recited in claim 10 wherein said deinterlacing 

processor is further operative to convert said interlaced video stream into a deinterlaced video 
stream using a conversion process that is dependent upon said detection of said original source 
type and sequencing. 

25 12. A digital image enhancer as recited in claim 9 wherein said deinterlacing 

processor is operative to reduce motion artifacts detected by a frequency analysis of said 
interlaced video stream. 



13. A digital image enhancer as recited in claim 9 wherein said deinterlacing 
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processor is operative to detect diagonal features and to smooth said detected diagonal features. 



14. A digital image enhancer as recited in claim 9 wherein said video output 
processor is operative to scale said deinterlaced video stream to modify a video display output 
format of a video output stream. 

15. A digital image enhancer as recited in claim 9 wherein said video output 
processor includes a data rate synchronizer between a first data rate of said deinterlaced video 
stream and a second data rate of a video output stream. 

16. A portable DVD player comprising: 

a generally thin prismatic enclosure having a first major surface, a second major surface 
separated from said first major surface, and side surfaces connecting said first major surface to 
said second major surface, wherein at least a portion of said first major surface includes a video 
display, and wherein said enclosure includes a DVD entry port such that a DVD can be inserted 
into said enclosure; and 

a digital processing system including a decoder, an image enhancement means, and a 
display controller where said decoder receives signals from a DVD inserted into said enclosure 
to provide a decoded, interlaced video signal, said image enhancement means converts said 
interlaced video signal to a deinterlaced video signal, and said display controller uses said 
deinterlaced video signal to provide progressively scanned video on said video display. 

17. A portable DVD player as recited in claim 16 wherein said digital processing 
system includes a microprocessor providing control signals to said decoder, said image 
enhancement means, and said display controller. 

1 8. A portable DVD player as recited in claim 16 further comprising a DVD transport 
mechanism associated with said port in said enclosure. 
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19. A portable DVD player as recited in claim 18 wherein said DVD transport 
mechanism comprises a drawer which extends from a side surface for the loading and unloading 
of a DVD and which retracts into said enclosure for the playing of said DVD. 



20. A portable DVD player as recited in claim 16 further comprising an infrared port 
associated with said enclosure and coupled to said digital processing system. 

21. A portable DVD player as recited in claim 20 further comprising an infrared 
remote control providing control commands to said DVD player via said infrared port. 

22. A portable DVD player as recited in claim 16 further comprising a docking 
station coupled to a video monitor, wherein said docking station includes a docking port 
receptive to at least a portion of said enclosure. 

23. A portable DVD player as recited in claim 16 further comprising shock isolation 
means for reducing the affect of physical shocks impinging upon said enclosure. 

24. A method for processing digital video comprising: 

deinterlacing an interlaced video stream by at least one of a number of deinterlacing 
methods to produce a deinterlaced video stream; and 

scaling said deinterlaced video stream. 

25. A method for processing digital video as recited in claim 24 wherein said 
deinterlacing methods include at least one of an original source detection method, a diagonal 
feature detection method, and a motion artifact detection method. 

26. A method for processing digital video as recited in claim 24 wherein said 
deinterlacing methods include processing said interlaced video stream in vertical slices. 
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27. A method for processing digital video as recited in claim 24 wherein said scaling 
includes a horizontal scaling of the deinterlaced video stream. 



5 28. A method for processing digital video as recited in claim 24 wherein said scaling 

includes a data rate synchronizer between a first data rate of said deinterlaced video stream and a 
second data rate of a video output stream. 
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If operation 552 determines that the coefficient to be adjusted is neither the leftmost or 
rightmost one, then an operation 558 is performed. Operation 558 evaluates whether the sum of 
c(k) and F is outside the limits of the coefficients on the left and right, that is c(k - 1) and c(k + 
1), by evaluating the equations c(k-l) < c(k) < c(k+I) and c(k-l) > c(k) > c(k+l). If either of the 
equations is true, then the coefficient c(k) is set equal to c(k) + F in operation 564 and a 
discontinuity is not introduced. Therefore, operation 546 is successfully exited. If either of the 
equations is false, then a loop iteration is performed in operation 560. 

Operation 560 increments the loop iterator variable so the next coefficient can be 
evaluated. An operation 562 asks whether all the coefficients in the set(i) have been evaluated. 
If all the coefficients in set(i) have not been evaluated, then control is passed to the top of the 
operation 548 and the procedure is repeated for the next coefficient. If all the coefficients in 
set(i) have been evaluated, then the coefficient set cannot be quantized to n bits without 
introducing an unacceptable discontinuity into the coefficient set. Therefore, n is incremented in 
an operation 566 and operation 546 is exited. Control is then returned to the method 526 at 
operation 534 (in Figure 25) where the original coefficients are quantized to the new value of n 
and the process is repeated. In the cases where operation 546 is successfully exited, control is 
returned to the method 526 (in Figure 25) at the operation S38 and the next coefficient set(i) is 
evaluated. 

The two-dimensional chip architecture 50 and the addressing and sequencing FIFOs 90 of 
Figure 4 are discussed in greater detail with reference to Figures 27-33. Figure 27 illustrates a 
video frame 600 in accordance with one embodiment of the present invention subdivided into a 
number of vertical slices 602 for a slice scanning sequence exemplified by a corresponding 
number of scan lines 604. Each slice 602 is scanned in a format similar to that used in a 
conventional raster-scanned sequence, with the scanning sequence proceeding to the subsequent 
slice when the end of a given slice is reached. The advantage of this format is that the length of 
the line memories is reduced by a factor roughly equal to the number of vertical slices used. 
Line memories are still necessary, but they are now much shorter than before, resulting in a 
much reduced on-chip memory requirement. For instance, if the number of slices were set to 10, 
the reduction in on-chip memory would be by an entire order of magnitude. 

However, difficulties do arise from utilizing this "slice" scan organization. First, it is 
often the case that processing must simultaneously be done in both the horizontal and vertical 
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directions. This results in a problem on the left and right slice boundaries where horizontal pixel 
data outside the slice may not be available. Second, the conventional raster-scan sequencing has 
been changed, resulting in a potential incompatibility with common video sources and 
display/storage devices. Both of these problems will be addressed in the following discussion as 
solved by the present invention. 

Figure 28 illustrates an example of a slice core 606 that has a problem with unavailable 
data on its left edge 608 and right edge 610. For purposes of illustration, unavailable data is 
shown only on the left edge in Figure 28. Video processing requires that data surrounding a 
given pixel be available in both the horizontal and vertical directions (in this case 5x5 matrices 
612 and 614 centered on the pixel). 

Processing matrix 612 resides in the center of the slice core 606, so there is no problem 
with availability of data because it is available in both horizontal and vertical directions on all 
sides of processing matrix 612. The situation at the top edge 618 and bottom edge 620 of the 
slice core 606, where data above the top-most pixel and data below the bottom-most pixel is not 
available, is identical to that with the conventional raster-scanned format. This can be solved in a 
number of ways, such as substituting zero data for the nonexistent upper/lower pixel data. 
Therefore, the top and bottom edges 618 and 620 of the slice core 606 will not cause problems 
with unavailable data. 

In contrast, processing matrix 614 is on the left edge 608 of the slice core 606, 
horizontally adjacent data. Thus, two columns of pixel data 616 are missing because they are 
outside the left edge 608 of the slice core 606. To resolve this situation, data for these columns 
are provided from the slice immediately to the left of the slice being processed. 

Figure 29 illustrates a slice 622 that includes a pair of thin vertical slices or t4 wings" 624 
and 626 along the left and right edges 608 and 610. Wing 624 has been added to the slice core 
606 to provide the pixel data needed for the processing matrix. Wing 626 has been added to the 
right edge 610 of the slice core 606. Because wing 624 has been added to slice 622, processing 
matrix 614 no longer suffers from the lack of data outside of the left edge 608 of slice 622. 

Figure 30 illustrates an overall structure of overlapping slice/wing combinations 628. 
Slice 622 from Figure 29 is shown as an exemplary slice. Wings 624 and 626 of slice 622 are 
composed of data from a pair of adjacent slices, one to the left and one to the right of slice 622. 
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